Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension
نویسندگان
چکیده
BACKGROUND Biomedical scientists need to access figures to validate research facts and to formulate or to test novel research hypotheses. However, figures are difficult to comprehend without associated text (e.g., figure legend and other reference text). We are developing automated systems to extract the relevant explanatory information along with figures extracted from full text articles. Such systems could be very useful in improving figure retrieval and in reducing the workload of biomedical scientists, who otherwise have to retrieve and read the entire full-text journal article to determine which figures are relevant to their research. As a crucial step, we studied the importance of associated text in biomedical figure comprehension. METHODS Twenty subjects evaluated three figure-text combinations: figure+legend, figure+legend+title+abstract, and figure+full-text. Using a Likert scale, each subject scored each figure+text according to the extent to which the subject thought he/she understood the meaning of the figure and the confidence in providing the assigned score. Additionally, each subject entered a free text summary for each figure-text. We identified missing information using indicator words present within the text summaries. Both the Likert scores and the missing information were statistically analyzed for differences among the figure-text types. We also evaluated the quality of text summaries with the text-summarization evaluation method the ROUGE score. RESULTS Our results showed statistically significant differences in figure comprehension when varying levels of text were provided. When the full-text article is not available, presenting just the figure+legend left biomedical researchers lacking 39-68% of the information about a figure as compared to having complete figure comprehension; adding the title and abstract improved the situation, but still left biomedical researchers missing 30% of the information. When the full-text article is available, figure comprehension increased to 86-97%; this indicates that researchers felt that only 3-14% of the necessary information for full figure comprehension was missing when full text was available to them. Clearly there is information in the abstract and in the full text that biomedical scientists deem important for understanding the figures that appear in full-text biomedical articles. CONCLUSION We conclude that the texts that appear in full-text biomedical articles are useful for understanding the meaning of a figure, and an effective figure-mining system needs to unlock the information beyond figure legend. Our work provides important guidance to the figure mining systems that extract information only from figure and figure legend.
منابع مشابه
Classification of Figures in Biomedical Literature toward a Figure Finding System
As biomedical full-text papers are becoming more available in digitized form on-line, there is a need for tools to mine information from all parts in the papers. Notably, since figures and their legends/captions in biomedical papers provide important information about research outcomes, mining techniques targeting them have attracted a great deal of attention. However, even a simple-sounding ta...
متن کاملFigure-Associated Text Summarization and Evaluation
Biomedical literature incorporates millions of figures, which are a rich and important knowledge resource for biomedical researchers. Scientists need access to the figures and the knowledge they represent in order to validate research findings and to generate new hypotheses. By themselves, these figures are nearly always incomprehensible to both humans and machines and their associated texts ar...
متن کاملCorrection: shaped magnetic field pulses by multi-coil repetitive transcranial magnetic stimulation (rTMS) differentially modulate anterior cingulate cortex responses and pain in volunteers and fibromyalgia patients
Correction In our recently published article [1] we made a mistake in describing the contribution of two of the authors. In particular, we indicated that AT helped in designing the study, carried out psychophysical testing, analyzed data and drafted the manuscript and that DCY designed the study, carried out psychophysical testing, analyzed data and helped in drafting the manuscript. This was a...
متن کاملDeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures
Hundreds of millions of figures are available in biomedical literature, representing important biomedical experimental evidence. Since text is a rich source of information in figures, automatically extracting such text may assist in the task of mining figure information. A high-quality ground truth standard can greatly facilitate the development of an automated system. This article describes De...
متن کاملAutomatic Figure Ranking and User Interfacing for Intelligent Figure Search
BACKGROUND Figures are important experimental results that are typically reported in full-text bioscience articles. Bioscience researchers need to access figures to validate research facts and to formulate or to test novel research hypotheses. On the other hand, the sheer volume of bioscience literature has made it difficult to access figures. Therefore, we are developing an intelligent figure ...
متن کامل